Add plugin registry and callbacks for AI model validation #26309

stefanberger · 2025-10-06T16:52:43Z

Purpose

This PR adds a plugin registry for AI model validation plugins and sets callbacks from which the plugins are invoked. The model validation can be used on AI models and LoRA adapters and therefore the plugin points are set to verify:

AI models available in the filesystem
AI models that are downloaded from Huggingface hub
LoRA adapters when they are loaded

The first plugin to use this new infrastructure will be used for integrity and provenance verification of AI models and LoRA adapters and will be hosted outside the vLLM repository.

Test Plan

The following new tests have been added:

pytest tests/v1/engine/test_engine_core_model_validation.py
pytest tests/model_executor/model_loader/test_model_validation.py

The following existing test have been extended:

pytest tests/lora/test_lora_manager.py

Test Result

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

stefanberger · 2025-10-06T16:54:31Z

@njhill FYI

gemini-code-assist

Code Review

This pull request introduces a plugin registry for AI model validation, allowing for integrity and provenance checks on models and LoRA adapters. Callbacks are added at various points in the model and adapter loading process to invoke these validation plugins. The changes span across model loaders, LoRA management, and the V1 engine core, with corresponding tests to ensure the validation mechanism is triggered correctly. The error reporting for LoRA loading failures is also improved. My review found a critical issue in the GGUF model loader that would prevent loading models from a URL.

vllm/model_executor/model_loader/gguf_loader.py

Add a model validation plugin registry where classes implementing the ModelValidationPlugin interface can be registered. Enable the validating on local models that have already been downloaded by the user. Add a test case with an already downloaded model whose config.json is unmodified so that a ModelConfig can be created from it. Signed-off-by: Stefan Berger <[email protected]>

Extend a LoRARequest with a validate() method to enable validation of a LoRA adapter when it is loaded. Add a test case. Signed-off-by: Stefan Berger <[email protected]>

Implement a method 'validate' in the BaseModelLoader that first checks whether any plugin requests to validate the given model and then possibly downloads all the model files, including the signature. For this, query the subclass of BaseModelLoader for its download type. Support validation of local models and those downloaded from Huggingface Hub. Add a test case. Signed-off-by: Stefan Berger <[email protected]>

Extend the reporting of an error over RPC by the cause of the error if it is known. This then for example not only reports that the signature verification failed but also the reason, such as when an unsigned file was found. Signed-off-by: Stefan Berger <[email protected]>

…aded The reason why a LoRA adapter could not be loaded may include information from model validation, such as that model signature verification did not succeed because unsigned files were found. Signed-off-by: Stefan Berger <[email protected]>

stefanberger requested review from jeejeelee, WoosukKwon, robertgshaw2-redhat, njhill, ywang96, comaniac, alexm-redhat, aarnphm, chaunceyjiang and 22quinn as code owners October 6, 2025 16:52

mergify bot added frontend v1 labels Oct 6, 2025

gemini-code-assist bot reviewed Oct 6, 2025

View reviewed changes

vllm/model_executor/model_loader/gguf_loader.py Outdated Show resolved Hide resolved

stefanberger added 5 commits October 6, 2025 22:43

Extend a LoRARequest to support model validation

6d3aae5

Extend a LoRARequest with a validate() method to enable validation of a LoRA adapter when it is loaded. Add a test case. Signed-off-by: Stefan Berger <[email protected]>

stefanberger force-pushed the validation-plugin.4upstream branch from 0087ecd to a150791 Compare October 6, 2025 22:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Add plugin registry and callbacks for AI model validation #26309

Add plugin registry and callbacks for AI model validation #26309

stefanberger commented Oct 6, 2025 •

edited by github-actions bot

Loading

Uh oh!

stefanberger commented Oct 6, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add plugin registry and callbacks for AI model validation #26309

Are you sure you want to change the base?

Add plugin registry and callbacks for AI model validation #26309

Conversation

stefanberger commented Oct 6, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

stefanberger commented Oct 6, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

stefanberger commented Oct 6, 2025 •

edited by github-actions bot

Loading